GOTPM: A Parallel Hybrid Particle-Mesh Treecode

نویسندگان

  • John Dubinski
  • Juhan Kim
  • Robin Humble
چکیده

We describe a parallel, cosmological N-body code based on a hybrid scheme using the particle-mesh (PM) and Barnes-Hut (BH) oct-tree algorithm. We call the algorithm GOTPM for Grid-of-Oct-Trees-Particle-Mesh. The code is parallelized using the Message Passing Interface (MPI) library and is optimized to run on Beowulf clusters as well as symmetric multi-processors. The gravitational potential is determined on a mesh using a standard PM method with particle forces determined through interpolation. The softened PM force is corrected for short range interactions using a grid of localized BH trees throughout the entire simulation volume in a completely analogous way to P3M methods. This method makes no assumptions about the local density for short range force corrections and so is consistent with the results of the P3M method in the limit that the treecode opening angle parameter, θ → 0. The PM method is parallelized using one-dimensional slice domain decomposition. Particles are distributed in slices of equal width to allow mass assignment onto mesh points. The Fourier transforms in the PM method are done in parallel using the MPI implementation of the FFTW package. Parallelization for the tree force corrections is achieved again using one-dimensional slices but the width of each slice is allowed to vary according to the amount of computational work required by the particles within each slice to achieve load balance. The tree force corrections dominate the computational load and so imbalances in the PM density assignment step do not impact the overall load balance and performance significantly. The code performance scales well to 128 processors and is significantly better than competing methods. We present preliminary results from simulations run on different platforms containing up to N = 1G particles to verify the code.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Next-Generation Parallel Particle-Mesh Language

We present our previous and current work on the parallel particle-mesh language PPML—a DSL for parallel numerical simulations using particle methods and hybrid particle-mesh methods in scientific computing.

متن کامل

Performance Evaluation of Treecode Algorithm for N-Body Simulation Using GridRPC System

GridSolve/NetSolve is a GridRPC supported middleware for parallel programming in a heterogeneous computing environment [3]. The purpose of GridSolve/NetSolve is to bring together disparate computational resources with a view to using their aggregate power and dominating the rich supply of services supported by the emerging Grid architecture. This paper is aimed at improving the performance of t...

متن کامل

A portable OpenCL implementation of generic particle-mesh and mesh-particle interpolation in 2D and 3D

Hybrid particle–mesh methods provide a versatile framework for simulating discrete and continuous systems. A key component is the forward and backward interpolation of particle data to mesh nodes. These interpolations typically account for a significant portion of the computational cost of a simulation. Due to its regular compute structure, interpolation admits SIMD parallelism, and several GPU...

متن کامل

Proceedings of the 3rd Workshop on Domain-Specific Language Design and Implementation (DSLDI 2015)

We present our previous and current work on the parallel particle-mesh language PPML—a DSLfor parallel numerical simulations using particle methods and hybrid particle-mesh methods inscientific computing.

متن کامل

A GPU-accelerated Boundary Element Method and Vortex Particle Method

Vortex particle methods, when combined with multipole-accelerated boundary element methods (BEM), become a complete tool for direct numerical simulation (DNS) of internal or external vortex-dominated flows. In previous work, we presented a method to accelerate the vorticity-velocity inversion at the heart of vortex particle methods by performing a multipole treecode N-body method on parallel gr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003